Picture for Chaoyang Wang

Chaoyang Wang

TAP-JEPA: Frozen Future-Latent Probing and Two-Stage Score Fusion for EPIC-KITCHENS-100 Action Anticipation

Add code
May 30, 2026
Viaarxiv icon

FROST-STA: Frozen Dense Features for the Ego4D Short-Term Object Interaction Anticipation

Add code
May 30, 2026
Viaarxiv icon

Helix4D: Complex 4D Mesh Generation

Add code
May 25, 2026
Viaarxiv icon

One-Step Distillation of Discrete Diffusion Image Generators via Fixed-Point Iteration

Add code
May 20, 2026
Viaarxiv icon

Rethinking Vector Field Learning for Generative Segmentation

Add code
Mar 19, 2026
Viaarxiv icon

VLA-Thinker: Boosting Vision-Language-Action Models through Thinking-with-Image Reasoning

Add code
Mar 15, 2026
Viaarxiv icon

V-Retrver: Evidence-Driven Agentic Reasoning for Universal Multimodal Retrieval

Add code
Feb 05, 2026
Viaarxiv icon

Revisiting Diffusion Model Predictions Through Dimensionality

Add code
Jan 29, 2026
Viaarxiv icon

A DeepSeek-Powered AI System for Automated Chest Radiograph Interpretation in Clinical Practice

Add code
Dec 23, 2025
Figure 1 for A DeepSeek-Powered AI System for Automated Chest Radiograph Interpretation in Clinical Practice
Figure 2 for A DeepSeek-Powered AI System for Automated Chest Radiograph Interpretation in Clinical Practice
Figure 3 for A DeepSeek-Powered AI System for Automated Chest Radiograph Interpretation in Clinical Practice
Figure 4 for A DeepSeek-Powered AI System for Automated Chest Radiograph Interpretation in Clinical Practice
Viaarxiv icon

AdaTooler-V: Adaptive Tool-Use for Images and Videos

Add code
Dec 19, 2025
Figure 1 for AdaTooler-V: Adaptive Tool-Use for Images and Videos
Figure 2 for AdaTooler-V: Adaptive Tool-Use for Images and Videos
Figure 3 for AdaTooler-V: Adaptive Tool-Use for Images and Videos
Figure 4 for AdaTooler-V: Adaptive Tool-Use for Images and Videos
Viaarxiv icon